CALT 68-2447 
August 2003 



Sensitivity of CKM fits to theoretical uncertainties 
and their representation* 

G. P. Dubois-Felsmann, D. G. Hitlin, and F. C. Porter 
California Institute of Technology, Pasadena, CA 91125 USA 



O 
O 

(N 

Q-r 

<D 
C/3 

(N 
(N 

> 
(N 
VO 
(N 
00 

O 
m 
O 

a: 
i 



X 



G. Eigen 
University of Bergen, Bergen, Norway 

There is now a rapidly growing body of experimental data relevant to the question of whether 
the standard model CKM quark mixing matrix is a correct description of CP-violation as well as of 
non-CP-violating flavor decay processes. In the detailed comparisons with theoretical predictions 
that are required to investigate this, a key challenge has been the representation of non-statistical 
uncertainties, especially those arising in theoretical calculations. The analytical procedures that have 
been used to date require procedural value judgments on this matter that color the interpretation of 
the quantitative results they produce. Differences arising from these value judgments in the results 
obtained from the various global CKM fitting techniques in the literature are of a scale comparable 
to those arising from the other uncertainties in the input data and therefore cannot be ignored. 

We have developed techniques for studying and visualizing the sensitivity of global CKM fits to 
non-statistical uncertainties and their parameterization, as well as techniques for visual evaluation 
of the consistency of experimental and theoretical inputs that minimize the implicit use of such 
value judgments, while illuminating their effects. We present these techniques and the results of 
such studies using recently updated theoretical and experimental inputs, discuss their implications 
for the interpretation of global CKM fits, and illustrate their possible future application as the 
uncertainties on the inputs are improved over the next several years. 



INTRODUCTION 

The three-generation Cabibbo-Kobayashi-Masakawa 
(CKM) quark mixing matrix is a key feature of the stan- 
dard model, with rich phenomenological consequences. 
Testing the consistency of different ways of measuring 
the matrix elements and the consistency of this matrix 
with unitarity in three dimensions has become an indus- 
try, and, as the precision of such tests improves, may 
lead to the observation of phenomena outside of the stan- 
dard model. The recent experimental observation of CP- 
violating asymmetries in B decays presents us with the 
hope to investigate the CP-violating phase of the CKM 
matrix from many perspectives, and further presents a 
new source of constraints on the unitarity aspect. A 
detailed answer to the consistency question requires a 
comprehensive comparison of a great deal of experimen- 
tal data with corresponding theoretical predictions. We 
present here an approach to investigate and visualize the 
state of our knowledge on this matter. 

Motivated by rapidly improving experimental knowl- 
edge, there has been substantial effort towards deriving 
statistics for describing the self-consistency of the stan- 
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dard model in the quark sector. A number of methods 
for performing such global fits have been presented in the 
literature over the past decade, including the so-called 
"scanning method" [lj , which delineates a region of con- 
sistency of parameters, but does not attempt to ascertain 
a "best value" of the CKM parameters. Several of the 
more recently developed techniques 0, Q aim to produce 
quantitative measures of the overall consistency of the 
experimental data and theoretical model, and to put for- 
ward preferred values for the CKM matrix elements. 

A key challenge in the development of these procedures 
has been the representation in the fits of non-statistical 
uncertainties, especially on theoretical parameters such 
as the B meson pseudoscalar decay constant fs and the 
"bag factor" Br arising in K meson decay calculations. 
Calculations of these parameters are generally published 
with "uncertainty ranges" reflecting the authors' degree 
of belief in the correctness of model assumptions or the ef- 
fects of ignoring higher-order terms in expansions. These 
do not in general have a precise statistical definition (ex- 
cept for those contributions arising from statistics-limited 
Monte Carlo or lattice calculations) , and there is no clear 
consensus in the community for the meaning to be at- 
tached to their precise numerical value. 

Nevertheless, in order to perform a "global fit" that in- 
corporates inputs with such uncertainties, in the frame- 
work of a standard minimization-of-deviations fitting 
procedure, the goodness-of-fit metric to be used must 
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somehow be constructed to give them quantitative ef- 
fect. Because of the lack of a clear statistical meaning 
for these uncertainties or a consensus on how to interpret 
them, the schemes used to do this have been a subject of 
considerable debate in the literature, and the procedu- 
ral value judgments involved inevitably color the inter- 
pretation of the quantitative results obtained from such 
analyses. 

Differences arising from these value judgments in the 
results obtained from the various global CKM fitting 
techniques in the literature have been shown to be of a 
scale comparable to the effects of the other uncertainties 
in the input data, and therefore cannot be ignored Q. 

In the fitting procedures published to date, these 
choices tend to be embedded in the details of the anal- 
ysis, and their effects are not readily made manifest in 
a studvof the outputs of the fit. For example, Ciuchini 
et al. ,2j take a Bayesian approach, choosing a uniform 
probability density function (p.d.f.) when "the parame- 
ter is believed to be (almost) certainly in a given interval, 
and the points inside this interval are considered equally 
probable" (italics ours). The fact is that values of a the- 
oretical parameter within an uncertainty range of this 
type may or may not be equally probable: their distribu- 
tion is generally a priori unknown. Thus their choice of a 
priori p.d.f. is in fact a considerably stronger statement 
than saying "the value should lie within this range" , and 
than the original theoretical author may have intended. 
It is when this p.d.f. is then convolved with others in the 
course of the fit that particular problems can arise. 

This is easily seen by considering the case of a sin- 
gle hypothetical positive-definite parameter £. Imagine a 
theoretical prediction that Q should lie between £1 and C2, 
based perhaps on the range of values yielded by taking a 
few different approaches to an approximate calculation. 
One would expect such a prediction to have the same 
intellectual content as one stating that £ 2 should lie be- 
tween £ 2 and ffi Y e t hi the Bayesian method, this is not 
so. The meaning of a flat p.d.f. depends on the choice of 
the actual form of the parameter in which its distribution 
is expected to be uniform. Whether this is chosen to be 
in terms of ( or £ 2 — a question to which there may not be 
a principled answer — can clearly introduce a differential 
bias in what values of £ are preferred in the resulting fits. 

This illustrates what we believe to be a serious and 
intrinsic weakness in the Bayesian approach. 

Hocker, et aL's approach to treating theoretical un- 
certainties is that of frequentist statistics. In their "RFit" 
scheme, they explicitly do not impute a uniform p.d.f. to 
theoretical uncertainties. Rather, they include in their 
likelihood fits an unnormalizcd penalty function, not in- 
terpretable as a p.d.f., which entirely bars values of the- 
oretical parameters outside their allowed ranges, while 
having no effect inside the range. This effectively re- 
places the Bayesian method's convolution integral over 
a theoretical parameter's p.d.f. with a logical OR. The 



consequence is that when an overall likelihood is formed 
the results at a point a in the parameter space should be 
interpreted as an "upper bound of the confidence level 
one may set on a, which corresponds to the best possible 
set of theoretical parameters". This avoids problems of 
the sort mentioned above. 

It retains the disadvantage that, in the graphical out- 
puts generated by the method, points lying on the same 
contour — say, 95% — of the "upper bound of the confi- 
dence level" can appear as somehow similar to each other. 
It requires considerable discipline to avoid reading such 
a graph as stating that points on the same contour are 
in fact "equally likely". Yet, to the extent that the best- 
fit results for two such points arise from different val- 
ues of the theoretical parameters, they are not. Such a 
statement, we maintain, simply cannot be made for non- 
probabilistic theoretical uncertainties. 

This critique of the RFit frequentist analysis primarily 
relates to the interpretation of results and the need for 
care in presentation. While this may seem an unduly 
psychological point, in practice it can be observed that 
independent readers of its results frequently fail to retain 
the qualification that its results are "upper bounds" on 
confidence levels. 

Our principal concern, however, which applies to 
both approaches, is the degree to which experimental 
uncertainties, probabilistic and typically approximately 
gaussian, are intertwined with theoretical uncertainties 
with no principled probabilistic interpretation. We be- 
lieve that this obscures the importance of distinguishing 
among sources of uncertainty and lends an undue patina 
of statistical precision to the numerical results obtained. 
This may result in inappropriate conclusions on the sig- 
nificance of results obtained from CKM fits, and on ques- 
tions of such central importance as the consistency of 
data with the standard model. 

We present herein techniques for ascertaining the sen- 
sitivity of global CKM fits to non-probabilistic uncer- 
tainties and their representation. These include novel 
techniques for visual evaluation of the consistency of ex- 
perimental and theoretical inputs, which minimize the 
implicit use of value judgments concerning such uncer- 
tainties, while illuminating their effects. 

These ideas are of general applicability; in this paper, 
however, we concentrate on the information available for 
the determination of the point (p, 77), the apex of the 
conventional Unitarity Triangle. Our approach relies on 
a careful separation of the types of uncertainties in the 
inputs to the analysis. We use a fitting procedure that 
avoids the need for Bayesian p.d.f. or likelihood repre- 
sentations of non-probabilistic uncertainties, and, in a 
frequentist framework, defers the application of the the- 
oretical inputs until the stage of presentation and inter- 
pretation of the fit outputs. 

This allows us to develop a clearer understanding of 
the role of these inputs in determining our present and 
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future knowledge of p and r\ and the consistency of theory 
with experiment. 



THEORETICAL AND EXPERIMENTAL 
BACKGROUND 

For the purposes of this analysis, we initially assume 
the standard model CKM matrix to be a comprehensive 
description of weak flavor physics (and therefore unitary), 
and the only source of CP- violation. We are then able to 
adopt a standard parameterization of it that extends the 
one of Wolfenstein [j| to higher order in A |(| , as shown in 
Eqn. [3 below. This parameterization is sufficiently pre- 
cise for present purposes. As is conventional, we define 
p=p-(l- A 2 /2) and fj = rj ■ (1 - A 2 /2). 

We take the parameter A, which determines \V US \ and 
the rest of the light-quark sector of the CKM matrix, 
as an input, A = 0.2241 ± 0.0033, as reported in a re- 



cent combined analysis of light-quark sector data |^ . We 
combine a variety of other measurements to constrain the 
matrix elements involving the third generation, V c b, V u b, 
V t d, and V ts - 

The magnitude of the V c b CKM matrix element can be 
measured using either inclusive or exclusive techniques. 
It has been measured_in the exclusive decay B — ► D*£v, 
where I = e or [i 
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Gf is the Fermi constant, JC(w) is a precisely known 
kinematic factor, and Td» (w) is a theoretically uncertain 
form factor. The measured quantities which are used in 
determining \V e b\ are the branching fraction and the B 
lifetime tb ■ Extrapolating the fit to w = 1 results in the 
measurement of \V c b\ Fd* (!)■ 
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The form factor .Fd»(1), which is calculated at zero 
recoil and in the heavy quark limit is identical to one, 
contributes an estimated theoretical uncertainty corre- 
sponding to a range in V c b of (40.2 - 43.8) x 10~ 3 

For B — > D*iv the hadronic form factor at zero recoil 
is approximately given by Q 

FD'(1) = VAVQED(l + S 1/m 2 +S 1/m 3 + ...), (3) 

where tja represents a short-distance correction aris- 
ing from the finite QCD renormalization of the flavor- 
changing axial current at zero recoil, t]qed — 1.007 de- 
notes QED corrections in leading-logarithmic order, and 
the 8i/ m n are higher order corrections in powers of the 
6-quark mass. Note that the first order is missing due 
to Luke's theorem [loT |. The exact two-loop expression 
yields r) A = 0.96 ± 0.007 0. 

For the higher-order power corrections, Ref. 0] cal- 
culated a range of —0.08 < 5x/m 2 < ^0.03 which agrees 
with the result of 8\/ m 2 = —0.055 given in Ref. [l3j |. 
Only at order 6i/ m 3 do the predictions differ. While in 
Ref. this is accounted for in 5i/ m z, an extra contri- 
bution is estimated from sum rules in Ref. |l3|. yielding 
Si/ m 3 — —0.03. Plugging these values into Eqn. |3| yields 

T D *{1) = 0.913±0.007 pert ±0.024 1/m 2±0.011 1/m 3. (4) 

The errors denote theoretical uncertainties from pertur- 
bative QCD, from 1/m 2 , and from 1/m 3 terms, respec- 



tively. The central values differ by ~ 3%, while the theo- 
retical uncertainties added in quadrature amount to 2.7% 
and 5%, for the two predictions. Both predictions are 
lower than, but consistent with, a recent quenched lat- 
tice gauge calculation of T D *{1) = 0.935 ± 0.033 Q. 
According to the recommendation of the LEP working 
group, we use T D * (1) = 0.91 ± 0.04. 

The magnitude of the V u b CKM matrix element can 
also be measured either via inclusive or exclusive tech- 
niques. The measurement of the exclusive branching frac- 
tion for B — > plv is related to | V u b \ according to Ref. : 

B(B^ ptv) = \V vb \ 2 -fpiv-TB. (5) 

A similar relation holds for the extraction of V u b from in- 
clusive branching fraction measurements. The predicted 
reduced exclusive decay rate, T p ^, depends on form fac- 
tors which have been estimated in various models |l6| . 
The predicted inclusive decay rates are calculated in the 
heavy quark expansion .17]. The extraction of V u b from 
exclusive (inclusive) branching fraction measurements in- 
troduces a theoretical uncertainty of 15% (10%). In the 
{p, fj) plane a range of |Kifc/V^f,| appears as a circular 
band centered at (0,0). 

The CKM element Vtd may be extracted from the 
B°B° oscillation frequency 

Q2 

Am B d = ■^ J 2VBmB d m 2 v S Q (xt)f' Bd BB d | V td V t * b | 2 , (6) 
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where t]b = 0.55 ± 0.01 is a QCD factor, rriB d the B me- 
son mass, raw the W boson mass, So(xt) the Inami-Lim 
function |ig for the box diagram, x t — m^/rn^ the ra- 
tio of top-quark mass to W boson mass squared, fg d 
the B meson decay constant, and Bg d the so-called bag 
factor. The estimated theoretical uncertainty introduced 
through fs d \/BB d is of the order of 20%. In the (p, fj) 
plane a range of \Vtd/V c b\ appears as a circular band cen- 
tered at (1,0). 

The CP- violation parameter \ex\ in the K°K° system 
provides another constraint in the (p, fj) plane: 

\e K \ cx B K Xm(V td V*) 

x {He(V cd V* a )[r]iSo{x c ) - rfySo(x c ,x t )] 
~TZe(V td V t * s ) m S (x t ) . (7) 

Here the geometrical representation is a hyperbolic band 
in the (p, fj) plane. 771,772,773 are QCD parameters 0, 
[20L l2l| ; x c = m 2 . I m^y is the ratio of charm-quark mass 
to W boson mass squared, and So (2;) are the Inami- 
Lim functions 18] for the electroweak box diagrams with 
charm and top quarks in the loop. For the central val- 
ues of the running top and charm quark masses used 
herein, S (x t ) = 2.45, S (x t ,x c ) = 2.31 x 10~ 3 , and 
S (x c ) = 2.62 x 10- 4 . 

Mixing in B s mesons provides a means to measure the 
CKM quantity |V^Vt s |. Similarly to Equation © for 
ArriB d , the B s oscillation frequency is related to |V t JV ts | 
by: 

Am Ba = p^^msm^f^BB^ 2 S (x t )\V t * b V ts \ 2 , (8) 



where £ = (/s s \fB Bs ) / (/s d \/B Bd ) ■ Experimentally, a 
lower limit on Amg a has been determined by combin- 
ing analyses of different experiments using the amplitude 
method |22J. For given values of £ and f Bd B Bd , this leads 
to a lower limit on |V^Vt s |. A measurement of IV^/V^I 
would yield another circular band in the (p, fj) plane cen- 
tered at (1,0). We can, at present, constrain only the 
upper side of this band. 

The measurements considered up to now are related 
to the sides of the Unitarity Triangle. Additional mea- 
surements, such as of the angles a,/?, 7 of the Unitar- 
ity Triangle, will provide additional constraints on the 
CKM parameters. Measurements of sin 2(3 are now rather 
precise H3 HJ |2fj: the world average is sin 2(3 = 
0.731 ±0.055. This angle is related to the CKM elements 
by 0: 

sin 2/3 appears as a set of rays that cross the point (1,0) 
in the (p, fj) plane. 



FITTING THE DATA 

We first describe our approach to the global fit in con- 
ceptual terms. The fit begins with a set of experimental 
observables Ei, which can be predicted on theoretical 
grounds. The inputs to these predictions are: the CKM 
matrix, assumed 3x3 and unitary, represented in terms 
of the Wolfenstein parameters Wj', a list of quantities 
Cfc, such as masses and lifetimes, with experimentally de- 
rived or other probabilistic uncertainties o~c k ; and a set 
of "theoretical parameters" Tg. The parameters Tg are, 
in principle, exactly calculable from the theory and the 
Cfe, but this is in practice difficult to do, and the choices 
of approximation methods, cutoffs, and the like intro- 
duce non-probabilistic uncertainties Ax t which arise, in 
essence, from human judgment. As it becomes possible 
to use lattice calculations and other quasi-statistical tools 
to evaluate the Tg, these uncertainties will be replaced by 
probabilistic ones. 

The predictions for the Ei take the functional form 

(E i )=S i (W j ]C k ;T t ). (10) 

These predictions are to be compared with experimental 
measurements of the observables, Ei ± cxe; . 

In order to test the consistency of theory and experi- 
ment and determine numerical values for the CKM ma- 
trix elements, then, we wish to perform a fit to the ensem- 
ble of data above. Notionally we would allow all the input 
parameters to vary in the fit, minimizing a consistency 
statistic (a x 2 or likelihood) constructed from the devia- 
tions of the predictions (Ei) from the measurements Ei, 
including their uncertainties, and from the deviations of 
the input parameters from their nominal values, includ- 
ing their uncertainties as well. The Wj, on which there 
is no a priori information, would be entirely free in the 
fit. 

The results of the fit would be an overall optimum 
set of values for the fit variables, including the CKM 
parameters Wj, a confidence level for the best fit, and 
a correlation matrix or set of confidence level contours 
characterizing the possible deviations from the optimum 
point. 

The problem, described previously, is that the uncer- 
tainties Ax t on the theoretical parameters do not in fact 
have a principled statistical interpretation. Thus, con- 
structing the consistency statistic for the full fit would 
require the use of a value judgment on how to proceed in 
the face of this. In a Bayesian analysis, we would impute 
an a priori p.d.f. to each of the Tg, somehow based on the 
At ( , and then proceed with a conventional fit. In the fre- 
quentist RFit approach, the contributions of the Tg would 
be represented as unnormalized "penalty function" fac- 
tors in the overall likelihood function, having value zero 
outside the intervals ± Ar e and one inside. These con- 
strain the fit to the theoretically allowed intervals with- 
out introducing any statistical preferences within them. 
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The fit can then proceed, but the use of these penalty 
functions requires the re-interpretation of the results as 
yielding not confidence levels but upper bounds on them. 

In order to avoid the need to convolute probabilistic 
and non-probabilistic uncertainties, we take a different 
approach. We acknowledge that we simply do not have 
the necessary information on the Tg_ to be able to per- 
form a global fit. We choose instead to treat each point 
in the space T independently, and defer the consideration 
of non-probabilistic uncertainties until the stage of inter- 
pretation of the fit results. The space T can be taken as 
extending well beyond its "theoretically allowed" bounds; 
this is useful in clarifying the interplay between theoret- 
ical and experimental constraints. 

For any given point in this space, which we call a model, 
we fix its Tg values, and then we perform the fit, allowing 
only the Wj and Ck to vary. The result of this proce- 
dure is, for each model in T, an optimum value of the 
fit variables, a value of the consistency statistic (a confi- 
dence level), and a measure of the uncertainties around 
the optimum point. This information can then be put to 
a variety of uses which we have found to shed interesting 
light on our understanding of the CKM data. We discuss 
these below after the presentation of the details of the 
fits as actually performed. 

A prerequisite to this approach is a detailed analysis 
of the uncertainties on each input to the fit, separating 
those which are statistical in nature from those which are 
not and for which there is no principled way to determine 
a probability density function. In some cases this has al- 
ready been done carefully by the authors of the inputs we 
use, but in others we have had to try to separate them out 
ourselves. That isn't always possible — sometimes uncer- 
tainties have been irreversibly convolved — and we hope 
by our work to demonstrate the importance of keeping 
these different sources of uncertainty distinct in published 
results. 

We follow the established convention in high energy 
physics for treating experimental systematic uncertain- 
ties as if they were statistical and approximately Gaus- 
sian. This convention is typically justified by noting that 
experimental systematics tend to be built up from many 
individual contributions that are not strongly correlated. 
Applying a heuristic descended from the Central Limit 
Theorem, this leads to the expectation that it is reason- 
able to approximate their sum as having a statistical, 
Gaussian distribution. Compared to the case of theoret- 
ical uncertainties, there is a longer history of doing this 
and a more uniform consensus on calculating systematic 
uncertainties so that their scale can be interpreted as a 
standard deviation. 

Nevertheless, if in the future we find that a particular 
experiment has a large systematic uncertainty dominated 
by a single contribution, we can choose to break it out for 
treatment similar to that for theoretical uncertainties. 

Once the uncertainties contributing to the fit have been 



identified and cataloged in this manner, we then select 
those non-probabilistic uncertainties that are expected to 
have the largest effect on the overall fit results. This is 
necessary because the methods described below are lim- 
ited in the number of these sources of uncertainty that 
can be studied simultaneously: the computational load is 
exponential in this number. As a result, we must compro- 
mise and adopt approximate statistical interpretations 
for the remaining, smaller, non-statistical uncertainties. 
Iterative application of our method can be used to ex- 
plore the validity of these choices. 

Separate Treatment of Probabilistic and 
Non-probabilistic Uncertainties 

Consider a quantity x having both probabilistic and 
non-probabilistic uncertainties. We may represent this 
as 

x= (x)±a x ±A x , (11) 

where (x) is the nominal central value of x, o~ x represents 
the sum in quadrature of all probabilistic uncertainties 
on x, and A^ represents a non-probabilistic uncertainty, 
typically a theoretical one. While o~ x has its usual inter- 
pretation as the width of a normal distribution, at this 
point we need not commit to any particular interpreta- 
tion of A x — it is merely a placeholder. 

For the purposes of the construction of the fit we re- 
state this schematically as: 

x=((x)±a x ) + (0±A x ). (12) 

That is, we represent it as the sum of a normally dis- 
tributed variable carrying the probabilistic uncertainties, 
and an additive shift with purely non-probabilistic uncer- 
tainties. We could also use an equivalent multiplicative 
definition, where appropriate. 

The value of x actually used in the fit is then computed 
as x = xp + inp , the sum of particular values of the two 
terms above, respectively. The value of xp is allowed to 
vary in the fit, with a contribution to the goodness-of-fit 
statistic representing its probabilistic uncertainty. For a 
X 2 -based fit, this contribution would be 

x% = (^-(-)) 2 (13) 

The value of #np is scanned over a range related to the 
uncertainty A x , chosen as appropriate to the interpreta- 
tion of that uncertainty, with an independent fit for xp 
(and any other fit variables) performed for each value of 
x NP . 

When discussing the results of the scan analysis, in 
order to improve the accessibility of the presentation, we 
compute and display the value 

x = (x) + x NP , (14) 
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thus recentering the scanned values on the nominal cen- 
tral value. 

In some cases, again for the purpose of clarity of pre- 
sentation, we present the results not in terms of the fit 
variable x, but in terms of a transformation of that vari- 
able. 

Thus, for instance, from Equation[5]we see that we can 
compute \V u b\ from exclusive data as: 

|VJ-(M^)». (15 , 

r 'piv is the quantity on which the theoretical uncertainties 
actually arise, and the one treated as the "x" in the fit 
according to the procedure above, with T p i v representing 
the scanned theoretical value. For the purposes of presen- 
tation, however, this variable is somewhat unattractive; 
readers will be more familiar with considering the theo- 
retical uncertainties as applied to \V u b\, and so will more 
readily understand the scanned values when presented in 
that form. 

These are inputs to fits, and must be displayed as such. 
We therefore define \V* b \ as the value of \V u b\ computed 
from the above equation using Y p e. v and the input values 
for B(B — > piv) and tb- When this appears in the results 
below, it should not be confused with the value of \V u b\ 



resulting from the fits, which will in general be different. 

The variables \Vj%\, \V° b x \, and |V^ n | are defined simi- 
larly and used in the presentation of results below. 



Details of the Fit 

Following the procedure described above for dealing 
with the non-probabilistic uncertainties on the theoreti- 
cal parameters Tg, wc define the notion of a "model", a 
specific set of these parameters 

M = {^.(i),r 

BK,£,,m,m,rj3,VB}- (16) 

The fit for a given "model" M. by definition incorpo- 
rates no contribution from the non-probabilistic uncer- 
tainties on these parameters. The parameters arc then 
scanned within designated ranges, performing a fit for 
each resulting model. We may define the range of the 
scan as the theoretically preferred intervals shown in Ta- 
ble[IJ or, for the sensitivity studies present here, substan- 
tially broader one. 

In detail, for each model A4, the \ 2 expression which 
is minimized is: 



x 2 m(Ap,v) 



\V cb T D *(l)\) - A 2 \ i \T D *(l)\ 2 )\ 2 , ({B^u) -r ce „A 2 \ 4 T b V ({B ply ) -r pe „A 2 \ 6 T B o(p 2 + V 2 )Y 



{B ulu ) -T ulu A 3 X 6 T b {p 2 + T] 2 )\ , f (Am Bd ) - Am Bd (A,p,r]) 



+ xkmn. (A p, n) 



(a^K s ) - sin 2/3(p, rj) \ 2 / (/b-/Bb) - Jb^/Bb 



Csin 2/3 



[B K )-B K \ 2 , f{\e K \)-\e K \{A,p,fjy 2 



(0 - £,\ 2 , ( W - A\ , f(m t )~m t \ 2 f{m c )-m c \ 2 ( (m w ) - m w ^ 2 



Jb°) -tb° \ 2 , ( (tb+) - t b + 



(/b+) - /b+ \ 2 , ( (/b s ) _ /b s x ~ 



a f B+ 



°1b 



Or, 



2 / \ 2 

T Bs ) -T Bs \ , ( (T Ab ) -T Ab \ 



(/b+.°) ~ /b+-° 



(17) 



In this expression, the notation () is used to denote the 
experimental input, averaged over experiments. 

The minimization solution (A, A, p, t))m for a particu- 
lar model now depends only on measurement errors and 
other probabilistic uncertainties. We have included in 
the any probabilistic component of the uncertain- 



ties on the theoretical parameters relevant to each par- 
ticular measurement. This is why terms for some of the 
model parameters appear in the x 2 ■ We have also treated 
the comparatively small uncertainties arising from r]2, ?73, 
and r\ B as probabilistic. 

We use the experimental measurements listed in Ta- 
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ble HI While the CKM matrix element |Vtd| is ob- 
tained from the measurement of the B®B® oscillation 
frequency pjif. the CKM matrix elements \V c b\, and 
\V u b\ can be extracted from both inclusive and exclusive 
semileptonic decays. 



Inputs for | V c i | 



The measured inclusive B 
to |y cb | by 



X c lv decay rate is related 



V cb \ = 



IT(B -> X c lv) 



r 



(18) 



SL 



where Tsl is the predicted semileptonic b — > c£i^ decay 
rate. We use the most recent branching fraction mea- 
surements from BABAR, Belle, CLEO, and LEP (see Ta- 
ble [Hj that have been collected by the Heavy Flavor 
Averaging Group. At the T(4S) the semileptonic de- 
cay rate is measured to be T(B — > XJLv) — 0.446 • (1 ± 
0.023 ± 0.007) x 10~ 10 MeV while at LEP a value of 
T(B -► X c lv) = 0.441 • (1 ± 0.018) x 10~ 10 MeV is ob- 
tained. Both values are averaged including the B meson 
and b quark life times, respectively. In our maximum 
likelihood fits we scan the entire theoretical uncertainty 
in TgL, which results by adding the individual theoreti- 
cal uncertainties in quadrature. This amounts to a 5% 
theoretical error in \V cb \- 

In exclusive B — > D*lv decays \V c b\ may be extracted 
from the lepton spectrum at zero recoil \V c b\ x -F(l) 
We use the average value for \V c b\ X ^(l) provided by 
the Heavy Flavor Averaging Group, based on results 
from CLEO, Belle, ALEPH, DELPHI, and OPAL. We 
scan over the theoretical uncertainty of the prediction of 
.Fd*(1) which is presently 4.4%. 



Inputs for \ V u b\ 

The inclusive value of V u b is obtained from five branch- 
ing fraction measurements at the T(4S) and the LEP av- 
erage (see Table IhT|) . Here the theoretical uncertainties 
enter in three ways, in the efficiency, in the branching 
fraction and in the extraction of V u b- In order to in- 
corporate these errors in the scan we adopt the follow- 
ing procedure. We first determine a weighted average of 
the five T(4S) measurements using statistical and sys- 
tematic errors only. Then we determine weighted aver- 
ages for the upper and lower bounds determined by the 
theoretical uncertainties (Bi + Ai,Bi — Aj), where the 
weights are determined from statistical and systematic 
errors only. The difference between the averaged upper 
and lower bounds and the average of the central value 
yield the theoretical uncertainty in the averaged branch- 
ing fraction B — > X u lv measured at the T(AS). We then 



average the T(4S) results and the LEP results using the 
-B-meson and b quark life times. From this value V u b is 
extracted using [lj| 



V uh \ = 0.0445 



B(b^ulv) 



1.55ps 



1/2 



0.002 

x (l±0.02 QCD ± 0.052 m J. 



(19) 



The theoretical uncertainties from the branching frac- 
tion and those from the extraction of |Kf>| are added in 
quadrature and converted into a factor that is scanned. 

Though CLEO and Belle have measured \V u b\ in var- 
ious exclusive decay modes we use the branching frac- 
tion of B° — > p~ iv by averaging the CLEO and BABAR 
measurements (also shown in Table ITTT|) . To incorporate 
the different theoretical uncertainties we also average the 
±1A upper and lower bounds using only statistical and 
systematic errors. The theoretical uncertainty is incor- 
porated into the predicted reduced rate that is scanned 
over in the maximum likelihood fit. 



Incorporating Amj, data 

Including the current information on Ams s in the fit 
poses some special problems. In this lower limit 

at 95% C.L. has been determined by combining analyses 
of different experiments using the amplitude method [22| . 
In this approach, we describe the B s and B s decays with 
the p.d.f.: 



V = - 

T 



1 ±Acos(Am Bs t) 



(20) 



where the amplitude A has been introduced. Experimen- 
tal measurements provide A and its uncertainty 04 as a 
function of Amg s . Comparing the measured amplitude 
to the expected one, we can add a term to the \ 2 func- 
tion 0: 



{A, p, fj) = -21n£ 00 (Am B J, 



(21) 



max( t ' For the non- 



where — 21n£ oc (Ams s ) = max( a ; 
zero values we have in the past used a subroutine pro- 
vided by F. Parodi ,38] that incorporates the available 
experimental information. We required our implemen- 
tation of the function to approach zero as A TOB — > 00, 
reflecting our contention that the experimental data does 
not rule out very large oscillation frequencies. In addi- 
tion, we constrained the function to be nowhere negative, 
as required for a \ 2 interpretation. This procedure may 
be criticized for its ad hoc construction. In addition, 
when using it in fits we have in some cases encountered 
numerical instabilities arising from its multiple minima 
and lack of smoothness in the resulting function. 

Given the difficulties with this approach, we now use a 
somewhat different method for incorporating the Am^ 



8 



TABLE I: Input parameters for unitarity triangle fits. Parameters cited as "Statistical" are allowed to vary in the fit, with 
a probabilistic uncertainty incorporated into the x 2 function defined in Eqn. 1171 Parameters marked "Scanned" are held 
fixed in the minimization procedure, but varied from fit to fit over the non-probabilistic range shown here. Parameters with 
both designations are treated as discussed in the text; those with neither are treated as precise constants. The theoretical 
uncertainties on the QCD r\ parameters are in some cases scanned and in others incorporated into the fit as statistical, as 
described in the text. 



Variable 


Value 


Statistical 


Scanned 


Ref. 


B- and _K"-meson decay properties 


Am Bd 


0.503 ± 0.006 ps _i 


X 




[28J 


Ams, 

£K 


> 14.4 ps" 1 (95% C.L.) 
2.271 ± 0.017 x 10" 3 


X 
X 




f28] 
[29] 


sin 2(3 (B d ~* (cc)K) 


0.731 ± 0.055 


X 




[23, 24] 


6-hadron lifetimes 


TgO 
T~B + 
T B S 


(1.542 ±0.016) ps 
(1.674 ±0.018) ps 
(1.461 ±0.057) ps 


X 
X 
X 




[29J 
[29] 
[29] 




(1.229 ±0.08) ps 


X 




[29] 


6-hadron production parameters 


/b+ = /b° 


0.388 ±0.013 


X 




|29j 


/b+.o 


0.106 ±0.013 
1.04 ±0.08 


X 
X 




[29] 
[29] 


Meson masses 


m Bd 


5279.4 MeV/c 2 
5369.0 MeV/c 2 
497.7 MeV/c 2 






[29] 


m Bs 






[29] 


m K o 






[29] 


Running quark masses 


m t 
m c 


169.3 ±5.1 GeV/c 2 
1.3 ±0.1 GeV/c 2 


X 
X 




[3] 
[29] 


Standard model parameters 


mw 


80.423 ± 0.039 GeV/c 2 


X 




[29] 


G F 


(1.16639 ±0.00001) x 10~ 5 GeV 






[29] 


A 


0.2241 ±0.0033 


X 




w 


B factors and decay constants 


B K 


0.72 - 1.0 ±0.06 


X 


X 


[30] 




211 - 235 ± 33 MeV 


X 


X 


[30J 


C 


1.18 - 1.30 ±0.04 


X 


X 


[30] 


Form factors and reduced decay rates 




0.87 - 0.95 




X 


01 


r ' p tv 


12.0 - 22.2 ps" 1 




X 


[15, 33] 




54.8 - 79.6 ps" 1 




X 


[34J 


Prfi/ 


38.0 - 41.5 ps -1 




X 


w 


QCD parameters 




0.55 ±0.01 




(x) 


[20] 


Vi 
V2 


1.32 ±0.32 
0.574 ±0.01 




(x) 
(x) 


[19] 
[20] 


m 


0.47 ±0.04 




(x) 


[21] 



information into the fit. We start with the formula for 
significance 



5' 



£/B.(l 



2w)e 



-i(Am a <T t ) 2 



(22) 



where N is the sample size, fs a is the B s purity, w 
is the mistag fraction, and a t is the resolution. Wc 
use this formula in an empirical approach, and substi- 
tute symbol C for the expression J ^Jb s {^ ~~ 2io). Our 



use of this formula begins with interpreting S as the 
number of standard deviations by which AmB a differs 
from zero, S = Am^ /<7Am B (a similar result would 
be obtained with the interpretation applied instead to 
1/AwbJ. With this interpretation, we may express a 
contribution to the x 2 from the Abjb i measurements as: 



XAr 



= C 2 1 - 



-(Arn Bg a t ) 2 



(23) 



where A is the best estimate according to experiment. 
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TABLE II: Inclusive branching fraction measurements of B — * X c iv. 



Experiment 


Branching Fraction [%] 


Reference 


T(4S I ) average 


10.70 ±0.28 


f35] 


LEP average 


10.42 ±0.26 


[9] 



TABLE III: Inclusive branching fraction measurements of B — > X u £u and selected exclusive measurements of B(B° — > p e + v e ) 



Experiment 


Method 


Branching Fraction [10 a ] 


Reference 


CLEO 

BABAR 

BABAR 

Belle 

Belle 


endpoint analysis 
endpoint analysis 
hadronic mass 
D<-*Hv tags 

Improved v reconstruction 


1.77 ± 0.115 st „ t ± 0.269 s „ s ± 0.327 fu - stat ± 0.20 /n _ SHS 

2.054 ± 0.189 stat ± 0.189 sys ± 0.388/„_ sta t ± 0.25/ u _ sys 

2.14 ± 0.29 3tat ± 0.25 SHa ± 0.37 6 _ u 

2.62 ± 0.63 3tat ± 0.23 SHa ± 0.05 6 _ c ± 0Al b ^ u 

1.64 ± 0.14 stat ± 0.36 sya ± 0.28 6 ^ c ± 0.22^ 


m 

[32] 
f34] 
[34] 
[34] 


T(4S) average 




2.031 ± 0.215 sta+SJ/s ± 0.31 t h 




LEP average 


hadronic mass/neural network 


1.71 ± 0.31 stat+S!/s ± 0.37j,^ e ± 0.21^ 


[37] 


CLEO 
BABAR 


B { > -» p~e + v £ 
B° -> p~e + v e 


2.17 ± 0.34 stat V %' 4 sys ± 0.01 ± 0.041 t/i 
3.29 ± 0.42 3tat ± 0A7 S ys ± 0-6 th 


[15] 
[33] 


T(4:S) average 




2.68 ± 0A3sta + sys ± 0.50th 





The values of (A, C 2 , at) are chosen to give a minimum 
at 17 ps -1 , and a x 2 probability of 5% at Amg s — 
14.4 ps _1 . This function is plotted in Fig. 2] with the 
— 2 ln£ 00 (Ar7is 3 ) curve described earlier superimposed. 
It may be noted that, in the region of small \ 2 , the two 
functions exhibit similar general features. Deviations in 
the region where both curves have large values don't mat- 
ter much — in that region the fit is poor in cither case. 
We have checked the sensitivity of our results to how 
rapidly the \ 2 rises at low values of Amg^ and find very 
little effect. Thus, we have some confidence that this 
empirical treatment is providing dependable answers. 



Fit results 

A "model" M. and its best-fit solution are kept only if 
the probability of the fit satisfies P(xm) > Pmin, which 
is typically chosen to be 5%. For each "model" M. ac- 
cepted, we draw a 95% C.L. contour in the (p, ff) plane. 
The fit is repeated for other "models" M. by scanning 
through the complete parameter space specified by a re- 
gion broad compared with the estimated theoretical un- 
certainties. 

The x 2 minimization thus serves two purposes: 

1. ) If a "model" M. is consistent with the data, we 

obtain the best estimates for the three CKM pa- 
rameters, and 95% C.L. contours are determined. 

2. ) If a "model" M. is inconsistent with the data the 

probability V^x^m) wm be low. Thus, the require- 




FIG. 1: The dashed blue curve is the empirical \ 2 expres- 
sion used to represent the experimental data on Ams, in the 
fits described in this paper. The solid red curve is an im- 
plementation based on the amplitude analysis method; when 
used in the role of a \ 2 term, the function is truncated to 
non-negative values (see Eqn. l|21|l L 



ment of V(xM)min > 5% provides a test of compat- 
ibility between data and its theoretical description. 



If no "model" were to survive we would have evidence 
of a consistency problem between data and theory, inde- 
pendent of the calculations of the theoretical parameters 
or the choices of their uncertainties. 
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Single-parameter sensitivity studies 

Demonstrating the impact of the different theoretical 
parameters on the fit results in the (p, fj) plane, Figures^! 
and show contours for fits in which only one parameter 
was scanned while the others were kept at their central 
values. These demonstrate the impact of the model de- 
pendence in each of the theoretically uncertain parame- 
ters. 

In these figures, the red contours correspond to vari- 
ation of the scanned parameter within its theoretically 
"allowed" range, while the green and cyan contours re- 
sult from doubling this range, symmetrically to lower and 
higher values. The scanned values of the parameter are 
spaced equidistantly, with nine values covering the cen- 
tral theoretically preferred range and four each the upper 
and lower extensions. If a fit fails the P(xM)min > 5% 
requirement, no contour is drawn. 

For qualitative comparison, we show the boundaries of 
the four bands for \V u b/V c b\, \Vtd/V c b\, sin 2/3, and e K - 
Since the theoretical parameters are kept at their central 
values except for the one being varied, the computation of 
these bands corresponding to the non-varied parameters 
reflects only experimental uncertainties. 

Scan results in the p-fj plane 

We now turn to scanning all parameters simultane- 
ously within their theoretically "allowed" ranges. Fig- 
ures 0] and show the resulting contours for scans in the 
multi-dimensional model space with some selections on 
the included experimental information. Note that there 
is no frequency interpretation for comparing which mod- 
els are to be "preferred" , other than the statement that at 
most one model is correct. In this analysis we cannot, and 
do not, give any relative probabilistic weighting among 
the contours, or their overlap regions. If we wished to do 
so, we would proceed to a Bayesian analysis. 

From Figure [5] we can determine ranges for the CKM 
parameters p, fj, A, and A that are consistent with at least 
one set of scanned parameters. Table Hvl summarizes the 
present results. The ±ler asymmetric experimental errors 
are shown separately. Furthermore, we obtain ranges for 
the angles a and 7, as well as results for m t , m c , and (3 
that are consistent with the input values. 

Extensions to the standard model 

BABAR and Belle have measured the CP asymmetry of 
B — > 4>K® , yielding a combined value of the sine term in 
the CP asymmetry of S^o = -0.39 ± 0.41 Hjj. In the 
standard model (SM) this should be equal to sin 2(3 to 
within ~4%. In models beyond the SM, however, it may 
differ. The present average deviates from sin 2(3 by 2.7a. 



We can use this result in our fits by adding an additional 
phase 9 S . The resulting set of contours in the 9 s -p plane 
is shown in Figure El (left plot) . Presently, the phase is 
consistent with zero as expected in SM. 

Physics beyond the SM may affect B®B® mixing and 
CP violation in B — > J/ipK® and B — ► tttt. Using a 
model-independent analysis we can add a scale parame- 
ter, r c i, for B^Bj mixing and an additional phase, 9d, for 
parameterizing the CP asymmetry in B — > J/tpK®. The 
SM is then represented by = 1 and 9d = 0. The plot 
on the right-hand side in Figure El shows the resulting 
contours in the Qd~ T d plane. With present uncertainties 
Td and 9d are consistent with the SM expectations. 

In order to extrapolate the p-fj plane in the future, we 
have assumed that both experimental errors and non- 
probabilistic theoretical uncertainties are reduced ac- 
cording to projections given in |4l| . In addition, we have 
tuned V u b determined from exclusive and inclusive mea- 
surements to yield a central value that lies inside the 
sin 2(3 band, as with the present central values no fit 
would survive. The resulting contours in the p-fj plane 
are shown in Figured In this case several sets of param- 
eters are consistent with the SM. The main constraint 
is given by sin 2(3. Measurements of 7 and sin 2a are 
necessary to reduce the range along the (3 rays. 

VISUALIZING THE ROLE OF THEORETICAL 
UNCERTAINTIES 

Returning to the idealized scenario discussed above, 
recall that the results of the scan of CKM fits over the 
space T include a mapping of the best-fit value of the 
fit's consistency statistic as a function of the Tg. This 
mapping, derived entirely without the use of theoretical 
knowledge of these parameters, amounts to a summary of 
our experimental knowledge of them, assuming the gen- 
eral validity of the theoretical framework from which they 
arise. It includes the effects of any correlations among the 
Tg arising from the experimental data. 

We can now in principle combine probabilistic and non- 
probabilistic information in a useful way without any 
need for convolution over a priori p.d.f.'s or an equivalent 
procedure: the above mapping, can simply be compared 
directly with the theoretically calculated values and un- 
certainties At,, for the theoretical parameters. 

This is illustrated conceptually, for the case of two pa- 
rameters, in Figure |H1 The curves represent confidence 
level contours in the space T\ , T2 arising from the fit in- 
puts with experimental (and other probabilistic) uncer- 
tainties. The straight lines represent the theoretical in- 
formation and its uncertainties. Our central point is that 
we hold that such a diagram contains an efficient sum- 
mary of all the available information (in a two-variable 
situation), and that one can go no further in distilling 
down the overall consistency of this information with- 
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-0.2 -0.1 0.1 0.2 0.3 0.4 $jj 0.S 



P 

FIG. 2: Contours of different models in the (p, 77) plane, by varying only one theoretical parameter at a time, left to right, top to 

bottom: , V^, V££, V^, Bk, f, /bVBb- In each case, a total of 17 values were scanned: nine values (red contours) spanning 
the region ±A t h around the central theoretical value, where A t h is the estimated theoretical uncertainty of the parameter; four 
values (cyan contours) spanning the range from — 2A t h to — A t h; and four values (green contours) spanning the range from A t h 
to 2 Ath- Note that sometimes fewer than 17 contours appear — this occurs when some of the values of the scanned parameter 
do not give an acceptable fit (less than 5% \ 2 probability). 



out introducing additional a priori information or value 
judgments. 

The challenge is to find a comparable way to display 
and understand the multi-dimensional data set resulting 
from the actual fits. Ideally, if we could display the entire 
space, T, of the selected parameters at once, no a priori 



assumptions about theoretical uncertainties would be re- 
quired at all. The viewer of such a multi-dimensional 
analog to Figure El could see at once the entire set of fits 
consistent with the experimental data, the effects on this 
set of the application of the theoretical bounds, and the 
correlations of these effects among the various parame- 
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FIG. 3: Contours of different models in the (p, rj) plane, by varying only one theoretical parameter at a time, left to right, top 
to bottom: 771, 772, rj3, and t}b- The interpretation of the various contours is the same as in Fig. [2] 




FIG. 4: The result of scanning over all theoretically uncertain parameters where: the data from inclusive \V u b\ and \V c b\ 
measurements are excluded (left plot), and the data from exclusive \V u b\ and \V c b\ measurements are excluded (right plot). 
The black dots show the central values from the fits, and a red contour for each fit indicates the 95% \ 2 probability region, 
according to the experimental (and statistical theoretical) uncertainties. 




FIG. 5: The result of scanning over all theoretical parameters, including information from both inclusive and exclusive mea- 
surements of \V u b\ and \V c b\, with the Am s constraint included in the fit (left) and excluded from the fit (right). 
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FIG. 6: The results for fits that include an extra phase, 8 S , for the CP asymmetry of B — > 4>Kg (left plot), and for a model- 
independent analysis that includes an extra scale parameter for B^B 1 ^ mixing, rd, and an extra phase for the CP asymmetry 
of B — *■ J/ipKg, 8d- Shown are contours in the 9 a -p plane and 6d~rd plane, respectively. 



TABLE IV: Results for selected fit parameters and the angles of the unitarity triangle from the fits in Figure The second 
and third columns show a lower and upper bound obtained from scanning all theoretical uncertainties. The fourth and fifth 
columns show the asymmetric experimetal errors obtained from the fits. 



Variable 


Mean value - A 


Mean value + A 


- a 


+a 


P 


0.1029 


0.3372 


-0.0668 


+0.0264 


V 


0.281 


0.409 


-0.0203 


+0.0337 


A 


0.796 


0.847 


-0.024 


+0.027 


A 


0.2231 


0.2251 


-0.0032 


+0.0032 


m t 


168.4 


170.2 


-5.04 


5.08 


m c 


1.267 


1.323 


-0.097 


0.0969 


P 


0.3619 


0.4688 


-0.0458 


+0.108 


a 


1.451 


2.0287 


-0.290 


+0.0941 


7 


0.70598 


1.30 


-0.05655 


+0.144 



_ 1 

«1 

0.9 


Lsin2P 
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FIG. 7: A scan over the allowed range of fits for an envisioned 
future set of improved experimental and theoretical results. 



ters. 

By adding more dimensions to the display for the out- 
put parameters of the fit (e.g., for p and 77), the data 
could be displayed as a hyperspatial surface in whose 
shape all the correlations of the fitted CKM parameters 
with the theoretical inputs would be manifest. Unfortu- 
nately, such a plot is beyond normal human ability to 
visualize directly, so some form of reduction to fewer di- 
mensions is needed. 

This makes it necessary, in effect, to integrate over the 



undisplayed dimensions' parameters. In order for this 
to take into account available theoretical information on 
those parameters, as it must in order to be useful, it is 
here that a priori bounds or p.d.f.'s must enter. In a 
Bayesian approach, the integration would be done over 
an a priori p.d.f. as a true convolution. In a classical 
frequentist approach the integration in effect becomes a 
logical OR over all values of the undisplayed parameters 
that are considered "acceptable" — also an a priori judge- 
ment, of course, but a weaker and less specific one than 
for the Bayesian approach. As noted above, in either ap- 
proach once the integration is performed the effects of the 
a priori input tend to be hidden in conventional displays 
of the fit results. 

We have searched for a method that maximizes the 
amount of simultaneously accessible information while 
minimizing the use and specificity of a priori inputs, de- 
ferring as nearly as possible until the moment of graph- 
ical display any convolution of probabilistic and non- 
probabilistic uncertainties, and thus maintaining a prin- 
cipled separation between these that clearly distinguishes 
the effects on the CKM fit of any a priori assumptions 
about the theoretical parameters. The one we have de- 
veloped is essentially frequentist in its treatment of the 
projections it makes, which we believe best meets our ob- 
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FIG. 8: Ideogram of the overlap of the theoretical bounds for 
two parameters Ti and T2 with a set of confidence levels con- 
tours obtained by scanning a fitting procedure, as described 
in the text, over the space of the parameters. The dashed 
lines show the nominal central values of the parameters and 
the solid lines their bounds. 

jectives, though a Bayesian interpretation of projection 
could also be applied in what follows, mutatis mutandi. 

Methodology 

• Pick two of the parameters T for display. Call these 
the primary parameters, T\ and T%. 

• Pick a third T parameter, the secondary parame- 
ter T s . This parameter is singled out for special 
attention to the effects of projecting over it. 

• Call all the other T parameters the undisplayed pa- 
rameters, Tx- 

• For each point P in the grid of scanned values of 
Ti ® T2, a number of fits will have been attempted, 
covering all the scanned values of T s and the Tx- 
For each P, evaluate the following hierarchy of cri- 
teria against the ensemble of results of these fits, 
deriving for the point a value we call the "Level" , 
an integer between and 5 inclusive: 

1. Define a minimum acceptable value of P{\ 2 )- 
Typically we use 1%, 5%, or 32% for this 
threshold. Did any of the fits for P pass this 
cut? If not, assign Level = and stop; other- 
wise assign Level = 1 and continue. 



2. Did any of the remaining fits lie within the 
theoretically preferred region for all the undis- 
played parameters Tx? If not, stop; if yes, 
assign Level = 2 and continue. 

3. Did any of the remaining fits have the sec- 
ondary parameter T s within its theoretically 
preferred region? If not, stop; if yes, assign 
Level = 3 and continue. 

4. Did any of the remaining fits have T s equal to 
its nominal central value? (That value must 
have been included in the scan grid for this 
to make sense.) If not, stop; if yes, assign 
Level = 4 and continue. 

5. Did any of the remaining fits have all the 
undisplayed parameters Tx also at their nom- 
inal central values? If not, stop; if yes, assign 
Level = 5 and stop. 

• Now display contours of the quantity Level over 
the grid in the T\ % T2 plane. We assign a unique 
color to each parameter T, so the contours for T s 
at Level = 3,4 are drawn in the color corresponding 
to that parameter. The contours for Level = 4,5, 
which correspond to restrictions of parameters ex- 
actly to their central values, are also drawn distinc- 
tively, with dashing. 

The Level 3 contour (solid, colored), in particular, 
displays the allowed region, at the selected confi- 
dence level, for T\ and Ti, based on the experi- 
mental data and on limiting all other theoretical 
parameters to their preferred ranges. Study of the 
relative spacing of the Level 2, 3, and 4 contours 
readily reveals the effects of the application of the 
T s bounds on the fit results. 

• Overlay the contours with straight lines showing 
the theoretically preferred ranges and nominal cen- 
tral values for T\ and Ti, in their respective unique 
colors, again with dashing for the central value. 
This allows the theoretical bounds on T\ and Ti 
to be evaluated directly for consistency against all 
other available data, yet avoiding any convoluted 
use of priors for these two parameters. 

Comparison of these theoretical bounds for T\ and 
Ti with the Level 3 contour that shows the experi- 
mental information, constrained by the application 
of the theoretical bounds on T s and the Tx, al- 
lows a direct visual evaluation of the consistency 
of all available information, with the effects of the 
application of all theoretical bounds manifest, not 
obscured by convolutions performed in the fit itself. 

• Cyclically permute the set {Ti, Ti, T s } right twice 
and repeat the previous procedure for the resulting 
two permutations. Taking all three of the resulting 
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plots together, then, each pair of parameters in the 
above set will have been used as the primary pa- 
rameters once, and each parameter will have been 
used as the secondary once. 

• Stitch the three resulting plots together, along their 
common edges, onto three faces of a cuboid volume 
and display the result as a three-dimensional image. 
The consistent use of color allows the effects of the 
theoretical bounds on all three parameters to be 
understood together, despite the large amount of 
information in the plot. 



The basic two-dimensional visualization 

We first illustrate the concept with a single two- 
dimensional plot resulting from the procedure described 
above. Figure shows the results of this procedure for 
the standard inputs used in this paper, for simplicity us- 
ing only exclusive measurements for \V u b\ and \V c b\- The 
primary parameters T\ and Ti are Bk and V c b, shown in 
blue and green, respectively. The orthogonal bands show 
their theoretically preferred ranges and nominal central 
values. 
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FIG. 9: Contours of the "Level" quantity, defined in the text, 
in Bk versus V c b, calculated for a global CKM fit. From 
outside in, the contours are for Level=l) the fit probability 
cut P{x 2 ) > 0.32 (in solid black), 2) restricting the "undis- 

played parameters" £ and /bVBb to the ranges [1.18, 1.30] 
and [211,235], respectively (solid black), 3) restricting the 
"secondary parameter" V u b to the range [0.0028, 0.0038] (solid 
red), 4) restricting it to its nominal central value of 0.00332 
(dashed red), and 5) similarly restricting £ to 1.24 and 

/bVBb to 0.223 (dashed black). The orthogonal bands show 
the theoretically preferred ranges and nominal central values 
for Bk (in blue) and V c b (in green). The fit uses only infor- 
mation from exclusive measurements of \V u b\ an d \V c b\. 



Recall that the symbols X, as previously defined, rep- 
resent the non-probabilistic part of our lack of knowledge 
of the true parameters X. 

The secondary parameter T s is V u b, and the undis- 
played scanned parameters T\ in this case are £ and 

JbVBb- In this fit, and the others in this section, the 
QCD r\ parameters are incorporated into the fit with 
probabilistic uncertainties, not scanned, as a simplifica- 
tion. 

We have performed the fits scanned over a grid of all 
five of these variables, expanded by a factor of five from 
their theoretically preferred ranges. A strength of the 
present method is precisely that it permits the ready vi- 
sualization of the results of going outside the nominal 
bounds. 

The contours enclose successive values of the "Level" 
quantity, as defined above. The outermost, solid black 
contour surrounds the region of Level 1, in which fits 
are required only to pass a P(x 2 ) > 0.32 cut; the next, 
also solid black contour surroundsthe Level 2 region, 
which now limits fits to those with the "undisplayed pa- 
rameters" £ and /bV-Bb within their nominal theoret- 
ical ranges. The third contour surrounds the region of 
Level 3, where the "secondary parameter" V u b is limited 
to its nominal range, and is drawn in a solid color, in this 
case red. 

This contour represents our derived experimental 
knowledge of Bk and V c b given the theoretical con- 
straints on the other parameters. It can be compared 
with the theoretical contraints on Bk and V c b, show as 
the solid colored orthogonal bands. In this case, they 
overlap perfectly: thus we can see immediately that, 
given the inputs to these fits, our present experimental 
constraints on these parameters are strictly less restric- 
tive than the present state of theoretical knowledge. 

The dashed contours display the fit results when the 
theoretical inputs are constrained to their nominal cen- 
tral values; first, at Level 4, for V u b fixed to 0.00332, 
drawn in dashed color, and finally at Level 5 fixing all the 

remaining parameters (£ to 1.24 and JbVBb to 0.223), 
shown in dashed black. Note the consistent use of dash- 
ing to represent central values of the parameters. 

The fact that the dashed lines for Bk and V c b cross 
within the innermost dashed contour is a representation 
of the fact that a successful fit (P(x 2 ) > 0.32) is obtained 
even when all the scanned parameters are set to their 
central values — the set of all the nominal central values 
is self-consistent. 



The three-dimensional triplet plot 

The full power of the method becomes apparent when 
carrying out the last step in the program described above 
and considering the three-dimensional plots constructed 
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by assembling the three two-dimensional plots for the 
pairwise cyclic permutations of a set of three variables. 
Figure ITUk shows the results of this for the same set of 
variables as in Figure El thus, the latter figure appears 
as one of the faces of the cuboid in Figure ITUk. 

The consistent use of color and texture coding is im- 
portant to the comprehension of the plots. Most impor- 
tantly, in each face of a plot, the colored contours show 
the effect of the theoretical restrictions on the parame- 
ter displayed perpendicularly to it — and the same line 
styles are used to display the restriction itself on the per- 
pendicular planes. 

This graphical presentation clearly exhibits the corre- 
lations among parameters and measurements. For exam- 
ple, it is possible to tell by inspection in Figures ITUh.c.e 
that experimental data from exclusive measurements is 
coming to constrain the low side of \V u b\ as much as or 
more than the inputs from theory, with inclusive data 
similarly constraining the high side. 

Further, there is now a region of low \V u b\ versus high 
| Vcb | that is excluded on experimental grounds alone, 
without the need for any of the theoretical inputs here 
considered. In general, the correlation forced on \V u b\ 
versus \V c b\ by the experimental data is readily appar- 
ent. 

We can observe that £ is entirely unconstrained by 
experimental data within a range more than five times 
larger than its present theoretical uncertainty. Thus, the 
theoretical input on £ is seen to be of particular impor- 
tance. 

Bk is seen to be constrained both from above and 
below by the combination of experimental data and the 
theoretical input for \V c b\, to a region two to three times 
larger than the present theoretical uncertainty. This can 
be seen as a measurement of this quantitity, albeit one 
still dependent on the theoretical input for \V c b\- 

These are not altogether novel observations — they fol- 
low from the structure of the underlying theory, though 
the details are a quantitative matter — but we believe 
that this visualization method demonstrates them partic- 
ularly clearly. It seems a very useful tool in understand- 
ing the other CKM analyses, with their typically opaque 
convolution over the theoretical uncertainties. 

It also permits the use of physical judgement in evalu- 
ating the meaning of theoretical uncertainties: should a 
future version of one of these plots reveal an inconsistency 
of model and data, readers concerned about the role of 
theoretical uncertainties in that inconsistency could sim- 
ply read it off a figure, noting that, say, a modest shift 
in a particular parameter could resolve the discrepancy. 

Extending to p and fj 

The method can be extended to include the Wolfen- 
stein parameters p, fj, and A, or any other results of the 



fit, in the visualization of the results. Presentation in 
terms of p and fj, in particular, permits making a con- 
nection with the most common displays resulting from 
other CKM fitting techniques in the literature, and al- 
lows direct visualization of the effects of various theoret- 
ical bounds on the values of the Wolfenstein parameters. 

However, for each of those parameters that appears 
in a graph we must remove one of the theoretical pa- 
rameters into the less readily understood "undisplayed 
parameters" set and project over it, using some sort of 
convolution over its assumed distribution. This conflicts 
with the original rationale for our method, which is to 
minimize as much as possible the use of hidden a priori 
information, as in these projections. Thus the extension 
of these visualization methods to the Wolfenstein param- 
eters represents a compromise between objectives. 

With that in mind, the methods we have developed 
nonetheless can be used to make triplet plots including 
one, two, or even all three of the Wolfenstein parameters. 
Since they are outputs of the fit, they are continuous 
variables and do not form a grid the way the scanned 
parameters do. We simulate this by rounding them off 
to a grid — in effect, by binning them — before generating 
the contours for display. 

Because the Wolfenstein parameters do not have in- 
dependent theoretical bounds that can be used as cuts 
or displayed in a graphic, they cannot serve exactly as 
the "secondary parameters" in the scheme as described 
above. So, when one of them does appear in that po- 
sition, we treat it as if it were a secondary parameter 
without any cuts to be applied to it, and show only the 
Level 1, 2, and 5 contours in the plane to which it is 
orthogonal. While we also cannot show the solid-color- 
edged band that we use for theoretical bounds, but we 
do show a nominal value for each as a colored, dashed 
line, purely as a guide to the eye. 

Figure 1X71 shows the results of applying this technique 
to displays of a single theoretical parameter against the 
two Wolfenstein parameters p and fj, with one plot shown 
for each of a selection of three of the five theoretical pa- 
rameters scanned, Bk, V c b, and V u b- 

In each of the graphs, the p-fj plane displayed at the 
right rear of the three dimensional volume shows the ef- 
fects of including only fits with P(x 2 ) > 0.32 (outer black 
countour, Level 1), limiting the graph's four undisplayed 
parameters to their theoretical bounds (inner black con- 
tour, Level 2), and limiting the graph's displayed the- 
oretical parameter to its bounds (solid colored contour, 
Level 3), and to its nominal central value (dashed col- 
ored contour, Level 4). The nominal values used for the 
"guide lines" in the figure are p = 0.2 and fj = 0.3. 

Strikingly clear from these figures is the power and 
the independence from theoretical inputs of the sin 2/3 
constraint. In each plot in Figure ITTI a clear radial band 
from (/?, fj) = (1,0) corresponding to this constraint is 
visible, even for the outermost contour, which reflects 
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FIG. 10: "Triplet plots", as defined in the text, for, top to bottom, various sets of three of the five scanned parameters and, 
left to right, for fits using only exclusive or only inclusive measurements of \V u b\ and \V c b\- Each triplet plot is a combination 
of graphs of the kind shown in Figure [5] for all three combinations of a set of three theoretical parameters. The fit probability 
requirement remains P(x 2 ) > 0.32, and the theoretical uncertainty ranges and nominal values are those given elsewhere in the 
text. 
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FIG. 11: "Triplet plots", as defined in the text, showing the dependence of the fit outputs p and r\ on a single scanned 
theoretical parameter: from top to bottom, Bk, V c b, and V u t, and from left to right, for fits using only exclusive or only 
inclusive measurements of \V u b\ an d \Vcb\- The dashed lines drawn for p and fj are at purely indicative values of 0.2 and 0.3, 
respectively, and are there only to help guide the eye. The fit probability requirement remains P(x 2 ) > 0.32, and the theoretical 
uncertainty ranges and nominal values are those given elsewhere in the text. 



19 



scanning over all theoretical ranges enlarged by a factor 
of five. 

Also readily seen is the relative role of the precision 
of the predictions for each theoretical parameter in con- 
straining p and fj, especially radially. This can be ob- 
served by noting the difference in size of the Level 2 (sec- 
ond solid black) and Level 3 (solid color) contours. The 
former shows the effect of applying our theoretical knowl- 
edge of all but the selected parameter, and the latter that 
of adding to that our knowledge of that parameter. 

Thus, for instance, in comparing Figures II lb . II 1H . 
and II IF. based on fits to inclusive \V x b\ data, it is easy 
to see that theoretical input on Bx plays the largest role 
in constraining p and fj, while the theoretical input for 
V c b (for inclusive measurements) has little effect once the 
other theoretical parameters have been determined. (The 

corresponding plots, not included here, for /bV Bb and 
£ show that they have comparatively small roles as well.) 



CONCLUSION AND FUTURE DIRECTIONS 

This work is already beginning to shed light on the 
sensitivity of CKM fits to assumptions regarding the pa- 
rameterization of theoretical uncertainties. We believe 
that in the future it will become a valuable tool for under- 
standing the results of such fits and for assessing whether, 
in fact, inconsistencies with the standard model may have 
emerged as the precision and breadth of measurements 
are improved. 

By clearly separating the probablistic uncertainties as- 
sociated with measured quantitities from the a priori un- 
known distributions of theoretical quantities, we have at- 
tempted to expose the effects of assumptions about the 
distribution of theoretical uncertainties on the results of 
fits that purport to extract a statistic representing the de- 
gree of "self-consistency" of the CKM sector of the stan- 
dard model. In particular, the clear correlation found 
between "best-fit" values for p and fj and the theoreti- 
cal inputs for Bx and the computations of \V c b\, which 
are quite model dependent, should prompt a cautious 
approach in future attempts to ascertain the degree of 
consistency of unitarity triangle-based investigations of 
the CKM matrix. 

The existence of such correlations demonstrates the 
dangers of constructing contours covering a range of val- 
ues of theoretically uncertain parameters that may then 
be interpreted as contours of equal probability. 

In the next decade, as lattice calculations of many of 
the theoretical quantities that enter into CKM unitarity 
triangle studies improve and gain control of systematic 
uncertainties, it will be possible to assign a probability 
distibution function to theoretical uncertainties in a prin- 
cipled manner. Until we reach this nirvana, however, an 
explicit display of the effect of the (unknown) distribution 



of theoretical errors on conclusions about the validity of 
the standard model will remain important. 
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